Add LLM inference support to JMLC API by kubraaksux · Pull Request #2430 · apache/systemds

kubraaksux · 2026-02-13T16:58:11Z

Summary

Adds LLM text generation to JMLC API using Py4J to bridge Java and Python (HuggingFace models).

Changes

Connection.java: loadModel() to start Python worker
PreparedScript.java: setLLMWorker(), generate()
LLMCallback.java: Java interface for Python callback
llm_worker.py: Python worker loading HuggingFace models
JMLCLLMInferenceTest.java: Integration test

Test

mvn test -Dtest=JMLCLLMInferenceTest -pl .

WIP - looking for feedback on the approach.

- Connection.java: Changed loadModel(modelName) to loadModel(modelName, workerScriptPath) - Connection.java: Removed findPythonScript() method - LLMCallback.java: Added Javadoc for generate() method - JMLCLLMInferenceTest.java: Updated to pass script path to loadModel()

- Connection.java: Auto-find available ports for Py4J communication - Connection.java: Add loadModel() overload for manual port override - Connection.java: Use destroyForcibly() with waitFor() for clean shutdown - llm_worker.py: Accept python_port as command line argument

Move worker script from src/main/python/systemds/ to src/main/python/ to avoid shadowing Python stdlib operator module.

- Add generateWithTokenCount() returning JSON with input/output token counts - Update generateBatchWithMetrics() to include input_tokens and output_tokens columns - Add CUDA auto-detection with device_map=auto for multi-GPU support in llm_worker.py - Check Python process liveness during startup instead of blind 60s timeout

github-project-automation bot added this to SystemDS PR Queue Feb 13, 2026

github-project-automation bot moved this to In Progress in SystemDS PR Queue Feb 13, 2026

kubraaksux added 2 commits February 13, 2026 18:04

Add LLM inference support to JMLC API via Py4J bridge

8e7d6da

kubraaksux force-pushed the llm-api branch from 53a7d70 to 47dd0db Compare February 13, 2026 17:04

kubraaksux added 3 commits February 13, 2026 18:20

Move llm_worker.py to fix Python module collision

dacdc1c

Move worker script from src/main/python/systemds/ to src/main/python/ to avoid shadowing Python stdlib operator module.

Use python3 with fallback to python in Connection.java

29f657c

kubraaksux force-pushed the llm-api branch from d9b9f37 to 29f657c Compare February 14, 2026 15:10

kubraaksux added 3 commits February 14, 2026 17:04

Add batch inference with FrameBlock and metrics support

e40e4f2

Clean up test: extract constants and shared setup method

fdd1684

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LLM inference support to JMLC API#2430

Add LLM inference support to JMLC API#2430
kubraaksux wants to merge 8 commits intoapache:mainfrom
kubraaksux:llm-api

kubraaksux commented Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kubraaksux commented Feb 13, 2026

Summary

Changes

Test

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant